9 research outputs found

    Nutrient Estimation from 24-Hour Food Recalls Using Machine Learning and Database Mapping: A Case Study with Lactose

    No full text
    The Automated Self-Administered 24-Hour Dietary Assessment Tool (ASA24) is a free dietary recall system that outputs fewer nutrients than the Nutrition Data System for Research (NDSR). NDSR uses the Nutrition Coordinating Center (NCC) Food and Nutrient Database, both of which require a license. Manual lookup of ASA24 foods into NDSR is time-consuming but currently the only way to acquire NCC-exclusive nutrients. Using lactose as an example, we evaluated machine learning and database matching methods to estimate this NCC-exclusive nutrient from ASA24 reports. ASA24-reported foods were manually looked up into NDSR to obtain lactose estimates and split into training (n = 378) and test (n = 189) datasets. Nine machine learning models were developed to predict lactose from the nutrients common between ASA24 and the NCC database. Database matching algorithms were developed to match NCC foods to an ASA24 food using only nutrients (“Nutrient-Only”) or the nutrient and food descriptions (“Nutrient + Text”). For both methods, the lactose values were compared to the manual curation. Among machine learning models, the XGB-Regressor model performed best on held-out test data (R2 = 0.33). For the database matching method, Nutrient + Text matching yielded the best lactose estimates (R2 = 0.76), a vast improvement over the status quo of no estimate. These results suggest that computational methods can successfully estimate an NCC-exclusive nutrient for foods reported in ASA24

    Effect of Manual Data Cleaning on Nutrient Intakes Using the Automated Self-Administered 24-Hour Dietary Assessment Tool (ASA24)

    No full text
    BackgroundAutomated dietary assessment tools such as ASA24® are useful for collecting 24-hour recall data in large-scale studies. Modifications made during manual data cleaning may affect nutrient intakes.ObjectivesWe evaluated the effects of modifications made during manual data cleaning on nutrient intakes of interest: energy, carbohydrate, total fat, protein, and fiber.MethodsDifferences in mean intake before and after data cleaning modifications for all recalls and average intakes per subject were analyzed by paired t-tests. The Chi-squared test was used to determine whether unsupervised recalls had more open-ended text responses that required modification than supervised recalls. We characterized food types of text response modifications. Correlations between predictive energy requirements, measured total energy expenditure (TEE), and mean energy intake from raw and modified data were examined.ResultsAfter excluding 11 recalls with invalidating technical errors, 1499 valid recalls completed by 393 subjects were included in this analysis. We found significant differences before and after modifications for energy, carbohydrate, total fat, and protein intakes for all recalls (P < 0.05). Limiting to modified recalls, there were significant differences for all nutrients of interest, including fiber (P < 0.02). There was not a significantly greater proportion of text responses requiring modification for home compared with supervised recalls (P = 0.271). Predicted energy requirements correlated highly with TEE. There was no significant difference in correlation of mean energy intake with TEE for modified compared with raw data. Mean intake for individual subjects was significantly different for energy, protein, and fat intakes following cleaning modifications (P < 0.001).ConclusionsManual modifications can change mean nutrient intakes for an entire cohort and individuals. However, modifications did not significantly affect the correlation of energy intake with predictive requirements and measured expenditure. Investigators can consider their research question and nutrients of interest when deciding to make cleaning modifications

    Surveying Nutrient Assessment with Photographs of Meals (SNAPMe): A Benchmark Dataset of Food Photos for Dietary Assessment

    No full text
    Photo-based dietary assessment is becoming more feasible as artificial intelligence methods improve. However, advancement of these methods for dietary assessment in research settings has been hindered by the lack of an appropriate dataset against which to benchmark algorithm performance. We conducted the Surveying Nutrient Assessment with Photographs of Meals (SNAPMe) study (ClinicalTrials ID: NCT05008653) to pair meal photographs with traditional food records. Participants were recruited nationally, and 110 enrollment meetings were completed via web-based video conferencing. Participants uploaded and annotated their meal photos using a mobile phone app called Bitesnap and completed food records using the Automated Self-Administered 24-h Dietary Assessment Tool (ASA24®) version 2020. Participants included photos before and after eating non-packaged and multi-serving packaged meals, as well as photos of the front and ingredient labels for single-serving packaged foods. The SNAPMe Database (DB) contains 3311 unique food photos linked with 275 ASA24 food records from 95 participants who photographed all foods consumed and recorded food records in parallel for up to 3 study days each. The use of the SNAPMe DB to evaluate ingredient prediction demonstrated that the publicly available algorithms FB Inverse Cooking and Im2Recipe performed poorly, especially for single-ingredient foods and beverages. Correlations between nutrient estimates common to the Bitesnap and ASA24 dietary assessment tools indicated a range in predictive capacity across nutrients (cholesterol, adjusted R2 = 0.85, p 2 = 0.21, p < 0.05). SNAPMe DB is a publicly available benchmark for photo-based dietary assessment in nutrition research. Its demonstrated utility suggested areas of needed improvement, especially the prediction of single-ingredient foods and beverages
    corecore